The Influence of the Utterance Length on the Recognition of Aged Voices
نویسندگان
چکیده
This paper addresses the recognition of elderly callers based on short and narrow-band utterances, which are typical for Interactive Voice Response (IVR) systems. Our study is based on 2308 short utterances from a deployed IVR application. We show that features such as speaking rate, jitter and shimmer that are considered as most meaningful ones for determining elderly users underperform when used in the IVR context while pitch and intensity features seem to gain importance. We further demonstrate the influence of the utterance length on the classifier’s performance: for both humans and classifier, the distinction between aged and non-aged voices becomes increasingly difficult the shorter the utterances get. Our setup based on a Support Vector Machine (SVM) with linear kernel reaches a comparably poor performance of 58% accuracy, which can be attributed to an average utterance length of only 1.6 seconds. The automatic distinction between aged and non-aged utterances drops to random when the utterance length falls below 1.2 seconds.
منابع مشابه
Evaluation of Effects of Gradual Increase Length and Complexity of Utterance (GILCU) Treatment Method on the Reduction of Dysfluency in School-Aged Children with Stuttering
Objectives: The Gradual Increase Length and Complexity of Utterance (GILCU) therapy method is a form of operant conditioning. This is a precise and controlled treatment that is done in 54 steps in 3 speech situations consisting of monologue, reading, and conversation. This study aimed at examining the effects of GILCU treatment method on the reduction of speech dysfluency of scho...
متن کاملEffects of nasality and utterance length on the recognition of familiar speakers
The present study examines the effects of nasality and utterance length on memory of familiar speakers using the technique of voice line-ups. With this technique, presented speakers have similar speech F0, dialect, and age range, and they utter the same material. Sets of voice line-ups were elaborated each containing 10 male voices (1 target “familiar” voice and 9 “filler” voices). In each set,...
متن کاملRecognizing the Emotional State Changes in Human Utterance by a Learning Statistical Method based on Gaussian Mixture Model
Speech is one of the most opulent and instant methods to express emotional characteristics of human beings, which conveys the cognitive and semantic concepts among humans. In this study, a statistical-based method for emotional recognition of speech signals is proposed, and a learning approach is introduced, which is based on the statistical model to classify internal feelings of the utterance....
متن کاملGrammatical Competence Development of Nursery School Children Acquiring Persian
This qualitative study is conducted to answer four questions: First, whether there is a difference between the grammatical competence development of a group of children aged 2.6 (two years and six months) and a group of children aged 3.6 (three years and six months). Second, whether there is a significant difference between the two age groups concerning their Mean Length of Utterance (MLU). Thi...
متن کاملاثر طول گفته بر روانی گفتار خودانگیخته کودکان و بزرگسالان لکنتی فارسی زبان
Objective: recently, researchers have increasingly turned to study the relation between stuttering and utterance length. This study investigates the effect of utterance length on the amount of speech dysfluency in stuttering Persian-speaking children and adults in conversational speech. The obtained results can pave the way to reach a better understanding of stuttering of child and adults, as w...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010